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SPEECH RECOGNITION 

This invention relates to a method of and an apparatus for speech recognition 
which is robust to missing speech data. It is particularly useful in distributed speech 
recognition in which data is transmitted via a packet switched network. 

5 Recently there has been an enormous increase in the use of mobile devices such 

as mobile phones and personal digital assistants. It is desirable to make the human to 
device interface as natural and easy to use as possible. Speech recognition is one 
solution which increases naturalness, and overcomes the difficulties in using very small 
keyboards found on many mobile devices. A Personal Computer (PC) usually provides 

1 0 sufficient processing power to operate a speech recogniser. However, on mobile devices 
processing power is a limiting factor. One solution is to use distributed speech recognition 
(DSR). DSR makes use of remote speech recognisers which are accessed by a device 
across a transmission network. Speech data from the device is transmitted across the 
network to the remote speech recogniser and the remote speech recogniser processes 

1 5 the speech to provide a recognition result (or set of results) which is then transmitted back 
to the device. 

There are basically two types of network across which such information can be 
transmitted; namely connection-orientated networks and connectionless networks. The 
connection-orientated network is essentially the telephony service which has evolved over 
20 the last 100 years for the switching and transmission of voice data. A connectionless 
network is packet-based and its main functionality is the routing and switching of data 
packets from one location to another. 

When a call is made on a connection-orientated network a reservation is made to 
ensure that sufficient network resources are available to sustain the call. This may be the 
25 allocation of a physical connection or of time slots in a pulse code modulation (PCM) 
system. If sufficient resources are not available then the call is refused, typically 
accompanied by an engaged signal. 

The connectionless network is very much aimed at the routing and switching of 
data packets and is designed to efficiently handle the high burstiness of this traffic. 
30 Packets are comprised of two parts - a header and payload. The header contains 
information regarding the source and destination address while the payload contains the 
actual data which needs to be sent. 
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For transmitting real-time data such as speech, the essential difference between 
the two networks is that the connection-orientated network reserves sufficient capacity, or 
bandwidth, to maintain a connection throughout the call. With a connectionless network 
sufficient bandwidth is not guaranteed which means that the network may produce delays 
5 or missing packets which interrupt the data transmission. Therefore the connection- 
orientated network is much better suited to delivering real-time data. Voice has therefore 
traditionally been transmitted using connection-orientated networks. However, because of 
the enormous growth in data networks, the technique of Voice over Internet Protocol 
(VoIP) has been developed to allow the real-time transmission of voice signals across 
1 0 connectionless networks. 

In a connectionless network the packets containing the speech can be routed 
across a wide variety of paths depending on the network traffic. Indeed, it may be that 
successive packets are routed around the network on different paths. As a result it is 
possible that some packets arrive out of sequence or may never even arrive. This is 
1 5 clearly undesirable in a DSR system as it will introduce recognition errors. An approach to 
dealing with this problem of missing packets is to use protocols designed specifically for 
real-time data which ensure all the data arrives with minimal delay. 

The traditional connectionless network is termed best-effort. This means that 
packets from a source are sent to a destination with no guarantee of a timely delivery. For 

20 applications such as file transfer which require a guarantee of delivery, Transmission 
Control Protocol (TCP) is able to trade packet delay for guaranteed reception. In the event 
of lost packets TCP allows for the destination to request the retransmission of those lost 
packets. However, for real-time data it is important to minimise transmission delays. It is 
therefore impracticable to use TCP and wait for the retransmission of lost packets. A 

25 better approach is to use User Datagram Protocol (UDP) as the protocol for sending the 
packets. This has a short duration buffer which allows for slight delays in packet arrival 
after which UDP assumes the packet is not going to arrive. No facility for the 
retransmission of lost packets is available. This has the advantage that delays are 
minimised but at the expense of possibly losing some of the speech signal when network 

30 traffic is high and packet loss is probable. 

Protocols designed specifically for real-time data transmission include Resource 
Reservation Protocol (RSVP). This is a signalling protocol which reserves network 
resources at the start of a call to ensure that a direct connection to the destination is 
available throughout. In effect it makes a connection-orientated path from a 
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connectionless network. In order for this to function all the routers in the network from the 
source to destination must be RSVP enabled. As RSVP is a relatively new protocol not all 
routers are equipped with this facility. 

Another protocol designed specifically for real-time data transmission is DiffServ. 
5 This makes use of a byte of data in the packet header to specify a Type of Service (ToS) 
- i.e. how much priority should be given to the immediate routing of that packet through 
the network. Clearly some data will have very high priority such as network management 
and system commands. Lower priority will be given to file transfer and email where 
immediate delivery is not too important. Depending on the emphasis given to the network, 
10 high priority can be given to speech packets to assist real-time use. Again, this protocol is 
only in development and not available generally. 

The increase of connectionless voice networks, coupled with the increas in 
automation of call centres means that the ability to perform robust speech recognition over 
a connectionless network is becoming more important. 

15 An alternative approach to ensuring that all packets containing the speech signal 

successfully reach the speech recogniser is to make the recogniser itself robust to missing 
packets. When the packet loss is low (<5%) the drop in recognition performance is not too 
significant. However, as packet loss increases - or occurs in bursts - the effect is more 
detrimental. Therefore, a speech recogniser is required which is able to tolerate this loss 

20 of speech. 

Known signal processing techniques which deal with missing packets range from very 
simple to complex - a good review is made in C. Perkins, O. Hodson and V. Hardman, "A 
survey of packet loss recovery techniques for streaming audio", IEEE Network Magazine, 
Vol.12, No. 5, pp. 40-48, October 1998. Simple techniques include splicing which merely 
25 joins the speech signal together either side of the gap. Silence and noise substitution 
replace the missing frames of speech with either silence or noise. Repetition replaces the 
lost frames of speech with copies of the speech which arrived before the gap. 

More sophisticated techniques attempt to estimate the missing parts of the signal 
from those parts which have been correctly received. These include waveform substitution 
30 which uses the pitch on either side of the gap to estimate the missing speech. Time scale 
modification stretches the audio signal either side of the gap to fill in the missing speech. 
Regeneration-based repair uses parameters of the codec to determine the required fill-in 
speech. All these techniques attempt to reconstruct the time-domain speech signal. 
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According to the present invention there is provided a method of speech 
recognition comprising the steps of receiving a sequence of transmitted feature vectors, 
said feature vectors representing a speech signal; detecting the absence of a feature 
vector in the received sequence; generating an estimated replacement feature vector for 
5 the detected absent feature vector; inserting said replacement feature vector into the 
received feature vector sequence to provide a modified feature vector sequence; and 
performing speech recognition upon the modified feature vector sequence. 

Preferably the feature vector comprises a plurality of components and the 
generating step comprises estimating a component of a replacement feature vector by 
1 0 interpolating the corresponding component of a received feature vector. 

In a preferred embodiment the estimating step uses an interpolation coefficient 
corresponding to a component of the received feature vector and further comprising the 
step of updating the interpolation coefficient in response according to one or more 
received feature vectors. 

15 In an alternative embodiment of the invention the transmitted feature vectors 

include features in a cepstral domain, and in which the estimating step comprises the sub 
steps of converting a received feature vector to a spectral domain; estimating a spectral 
component by interpolating the corresponding component of the converted feature vector; 
and converting the estimated spectral component to said cepstral domain. 

20 According to another aspect of the invention there is provided a device for 

performing speech recognition upon a sequence of parameterised feature vectors 
comprising a missing feature vector detector arranged in operation to receive the 
transmitted feature vectors and to indicate the absence of a feature vector in the received 
sequence; a feature vector estimator arranged, in operation, to receive transmitted feature 

25 vectors and responsive to said indication from the missing feature vector detector to 
estimate a replacement feature vector; a sequence reconstructer arranged, in operation, 
to receive transmitted feature vectors and to receive a replacement feature vector and to 
provide as an output a modified feature vector sequence; and a speech recogniser 
arranged, in operation, to receive the modified feature vector sequence. 

30 Preferably the feature vector estimator comprises an interpolator arranged to 

receive a feature vector and to provide as an output a component of the replacement 
feature vector. 
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In a preferred embodiment in the interpolator uses an interpolation coefficient 
corresponding to a component of the received feature vector and in which the interpolator 
is arranged to update the interpolation coefficient in response to receipt of a feature 
vector. 

5 In an alternative embodiment of the invention the feature vector estimator 

comprises a first converter for converting a received feature vector to a spectral domain; 

an estimator for estimating a spectral component by interpolating the 
corresponding component of the converted frame; a second converter for converting the 
estimated spectral component to said cepstral domain. 

10 A data carrier loadable into a computer and carrying instructions for causing the 

computer to carry out a method according to the invention and a data carrier loadable into 
a computer and carrying instructions for enabling the computer to provide the device 
according to the invention are also provided. 

Embodiments of the invention will now be described, by way of example only, 
1 5 with reference to the accompanying drawings in which: 

Figure 1 is a schematic representation of a computer loaded with software embodying the 
present invention; 

Figure 2 is a functional block diagram showing program elements for software embodying 
a known technique for performing DSR; 

20 Figure 3 is a functional block diagram showing program elements for software embodying 
another known technique for performing DSR; 

Figure 4 is a functional block diagram of program elements that comprise a parameteriser 
of Figures 2 and 3; 

Figure 5 is a functional block diagram of the program elements that comprise the software 
25 indicated in Figure 1; 

Figure 6 is a functional block diagram of the program elements which comprise a feature 
vector regenerator of Figure 5; 

Figure 7 is a functional block diagram of the program elements that comprise a frame 
estimator shown in Figure 6 in one embodiment of the invention; and 

30 Figure 8 is a functional block diagram of the program elements that comprise the frame 
estimator shown in Figure 6 in a second embodiment of the invention. 
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Figure 1 illustrates a conventional computer 101, such as a PC, running a 
conventional operating system 103, such as Windows (a Registered Trade Mark of 
Microsoft Corporation), and having a number of resident application programs 105 
including a word processing program, a network browser and e-mail program and a 
5 database management program. The computer 101 is also connected to a conventional 
disc storage unit 111 for storing data and programs, a keyboard 113 and mouse 115 for 
allowing user input and a printer 117 and display unit 119 for providing output from the 
computer 101. The computer 101 also has access to external networks (not shown) via a 
network card 121. The computer 101 also includes a speech recognition program 109 that 
10 enables a speech signal received via the network card 121 to be recognised. 

In Figure 2, a mobile device 201 includes a framer 205 which divides a received 
speech signal into short duration frames, for example 30ms, and sends the resultant 
frames to an encoder 202. The encoder 202 encodes each frame of received speech into 
a suitable coded representation, for example using the standard codec defined in ITU- 

15 G.723.1, and the resultant coded representation is sent to a packetiser 203. The coded 
representation forms the payload for a packet (not shown) which has a header added by 
the packetiser 203. The packet is transmitted via a connectionless network 206 to a 
remote device 204. The remote device 204 includes an unpacketiser 207 which removes 
the header, and a decoder 208 which decodes the coded representation of the speech 

20 frame. Speech frames are sent from the decoder 208 to an audio reconstructor 209 where 
the speech signal is reconstructed. The speech signal is then parameterised by a 
parameteriser 210 to form feature vectors suitable for use by a speech recogniser 211. 
The parameteriser 210 comprises a basic feature extractor 213 and a feature processor 
212, the operation of which will be described later. 

25 Figure 3 shows a system for DSR which avoids encoding and decoding the 

speech signal of the speech by transmitting parameterised speech signals over the 
network 206. In Figure 3 a device 201' includes a basic feature extractor 213' which 
parameterises speech signals to form feature vectors. The speech feature vectors are 
packetised by the packetiser 203 and transmitted via the network 206 to a remote device 

30 204\ At the remote device the features are un-packetised by the unpacketiser 207 and 
transmitted to the speech recogniser 211 via the feature processor 212. This approach is 
advantageous over the system shown in Figure 2. Encoding and decoding the signal 
causes a degradation in quality of the speech signal. This causes a significant reduction in 
speech recognition performance. By parameterising the speech signal before 
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transmission across the network there is no resultant loss in speech recognition 
performance. As encoding and decoding of the speech signal is not required there is a 
significant saving in computation. 

The problem of missing packets needs to be addressed. In this invention 
5 reconstructing the transmitted feature vector sequence is performed by detecting mising 
feature vectors and subsequently estimating corresponding replacement feature vectors. 

Figure 5 shows a remote device 204" according to the invention, which is 
implemented on a conventional computer as illustrated in Figure 1 . After received features 
have been unpacketised by the unacketiser 207, missing features are restored by a 
1 0 feature vector regenerator 214. 

In the embodiment of the invention described here the basic feature vectors used 
are Mel-frequency cepstral coefficients (MFCCs). MFCCs are generated from a received 
speech signal as illustrated in Figure 4. A high emphasis filter 10, normally referred to as a 
pre-emphasis filter, receives a digitised speech waveform at, for example, a sampling rate of 
1 5 8 kHz as a sequence of 8-bit numbers and performs a high emphasis filtering process (for 
example by executing a 1 - 0.95.Z 1 filter), to increase the amplitude of higher frequencies. 

A sequence of 256 contiguous samples (referred to as a frame in this description) 
of the filtered signal is windowed by a window processor 1 1 in which the samples are 
multiplied by predetermined weighting constants using, in this embodiment of the 
20 invention, a Hamming window, to reduce spurious artefacts generated at the edges of the 
frame. Each frame overlaps with neighbouring frames by 50%, so as to provide one frame 
every 16ms. 

Each frame of 256 windowed samples is then processed by a MFCC generator 
12 to extract an MFCC feature vector comprising eight MFCCs. 

25 The MFCC feature vector is derived by performing a spectral transform, in this 

embodiment of the invention, a Fast Fourier Transform (FFT), on each frame of a speech 
signal, to derive a representation of the signal spectrum for each frame of speech. The 
terms of the spectrum are integrated into a series of broad bands, which are distributed in 
a 'mel-frequency' scale along the frequency axis, to provide nineteen mel-frequency 

30 features. These features are referred to as filterbank features in this description. The mel- 
frequency scale is a perceptually motivated scale, which comprises frequency bands 
evenly spaced on a linear frequency scale between 0 and 1 kHz, and evenly spaced on a 
logarithmic frequency scale above 1 kHz. The logarithm of each mel-frequency feature is 
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calculated and then a Discrete Cosine Transform (DCT) is performed to generate an 
MFCC feature vector for the frame. Features such as the mel-frequency features 
described above, which represent the frequencies within a signal are referred to as being 
features in a spectral domain. Features which represent the rate of change of frequencies 
5 in a signal, such as the MFCC's described above are referred to as being in a cepstral 
domain. 

For MFCC's it is found that the useful information is generally confined to the 
lower order coefficients, so in this embodiment of the invention nine cepstral coefficients 
are used. 

10 Before the features are transmitted to the feature processor 212, any missing 

front end feature vectors are restored by the feature vector regenerator illustrated in 
Figure 6. 

Estimation of missing feature vectors is a simpler problem than estimation of the 
original time-domain speech signal. Feature vectors are highly correlated with one another 
15 in time, and represent a longer portion of speech than a single digital time-domain sample. 
In the embodiment of the invention described here 256 time-domain samples are 
represented by 9 MFCC's. The estimation of 9 MFCC's which are highly correlated with 
preceding and following MFCC's is much simpler than accurate estimation of 256 
samples. 

20 In Figure 6 a stream of MFCCs is shown with one missing feature vector due to 

the packet having that feature vector as part of it's payload being lost in the network 206. 
Received MFCCs are first passed into a missing feature vector detector 501 which 
identifies whether any feature vectors are missing or not. If a missing feature vector is 
detected a feature vector estimator 502 is used to estimate the missing feature vector. 

25 The feature vector sequence is then reconstructed by the sequence reconstructor 503. 
The resultant reconstructed sequence may then be used for speech recognition is the 
usual way. 

In the embodiment of the invention described here the missing feature vector 
detector 501 uses feature vector numbering. An additional feature is added to the feature 
30 vector by the basic feature extractor 213', the additional feature indicates the position of 
each feature vector in the feature vector sequence. At the remote device 204" the missing 
feature vector detector 501 checks the feature vector number of each feature vector 
received and uses this number to detect whether there are any missing feature vectors, 



i 



9 



and if so how many. When one or more missing feature vectors are detected a signal is 
sent to the feature vector estimator 502. 

The feature vector estimator 502 uses interpolation to estimate the missing 
speech features. Each feature is estimated separately and the time series of each is used 
5 to form a polynomial which enables missing elements to be estimated. In this embodiment 
of the invention a simple straight line interpolation is used. A detailed description of 
interpolation algorithms is provided in S.V. Vaseghi, "Advanced signal processing and 
digital noise reduction", John-Wiley, 1996 

Figure 7 illustrates interpolation of a MFCC feature vector. For each feature in the 
10 feature vector a corresponding interpolator 601, 602, 609 is established. As each new 
feature vector arrives the interpolation coefficients for each feature of the feature vector 
are updated. When a missing feature vector is detected by the missing feature vector 
detector 501 an estimate of the missing feature vector is made using the interpolators 
601, 602, .. 609. The estimate of the missing frame is then inserted into the feature vector 
1 5 sequence by the sequence reconstructor 503. 

In another embodiment of the invention the MFCC feature vectors, which are in 
the cepstral domain are converted back into the spectral domain so that interpolation is 
performed on features which represent the frequencies in the original signal. Upon 
detection of a missing feature vector the interpolator produces an estimate of a filterbank 

20 feature vector. This is then logged and a DCT applied to transform the estimate into the 
MFCC domain. This is illustrated in Figure 8 in which a sequence of received MFCC 
feature vectors 701 has an inverse DCT applied to it at 702, and is exponentiated at 703 
(i.e. the inverse of a logarithm is applied) to provide a sequence of filterbank feature 
vectors. A filterbank interpolator 705 is used to provide a filterbank estimate 706 of a 

25 missing feature vector, and the filterbank estimate 706 has a logarithm calculated at 707 
and a DCT applied at 708 to provide an MFCC estimate 709. 

After the feature vector sequence has been reconstructed by the feature vector 
regenerator 214, processing of the basic features prior to recognition is performed by the 
feature processor 212. RASTA filtering is applied by bandpass filtering the time series of 
30 feature vectors. A detailed description of RASTA filtering may be found in H. Hermansky 
and N. Morgan, "RASTA processing of speech", IEEE Trans. Speech and Audio Proa, 
vol. 2, no. 4, pp. 578-589, October 1994. Any channel distortion is additive in the cepstral 
domain, so applying a sharp cut-off highpass filter to each of the features, across time, 
removes any offset and hence suppresses channel distortion. Cepstral Time Matrix (CTM) 
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features are then calculated by taking a DCT across a sequence of seven MFCCs. A 
detailed description of CTM features may be found in BP. Milner, "Inclusion of temporal 
information into features for speech recognition", Proc. ICSLP, pp. 256-259, 1996 

It will be appreciated by those skilled in the art that the technique described could 
5 be applied to other types of basic speech parameterisation. Cepstral features may be 
calculated using a Fourier transform as described here, or using linear predictive (LP) 
analysis. It can be proven that the resultant cepstrum from either of these two routes is 
identical. In the embodiment of the invention described here the Fourier transform based 
cepstrum has been modified to include a mel-scale filterbank resulting in MFCCs. 

10 A process similar to the mel-scale filterbank is used in perceptual linear predictive 

(PLP) analysis where a set of critical-band filters are convolved with the speech spectrum. 
These modify the spectrum according to perceptual measurements of human hearing and 
lead to the PLP cepstrum. 

It will also be appreciated by those skilled in the art that other feature vector 
15 processing techniques could be applied, for example differential features may be 
calculated, such as Velocity' and 'acceleration' of the basic features. Cepstral mean 
normalisation in which the average of each feature is subtracted from each feature 
respectively, may be used. Linear discriminant analysis (LDA) as described in E.J. Paris 
and M.J. Carey, "Estimating linear discriminant parameters for continuous density HMMs", 
20 Proc. ICSLP, pp. 215-218, 1994 may also be used. 

As will be understood by those skilled in the art, the speech recognition program 
1 09 can be contained on various transmission and/or storage mediums such as a floppy 
disc, CD-ROM, or magnetic tape so that the program can be loaded onto one or more 
general purpose computers or could be downloaded over a computer network using a 
25 suitable transmission medium. 

Unless the context clearly requires otherwise, throughout the description and the 
claims, the words "comprise", "comprising" and the like are to be construed in an inclusive 
as opposed to an exclusive or exhaustive sense; that is to say, in the sense of "including, 
but not limited to". 
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CLAIMS 

1 . A method of speech recognition comprising the steps of 

receiving a sequence of transmitted feature vectors, said feature vectors 
representing a speech signal; 
5 detecting the absence of a feature vector in the received sequence; 

generating an estimated replacement feature vector for the detected absent 
feature vector; 

inserting said replacement feature vector into the received feature vector 
sequence to provide a modified feature vector sequence; and 
1 o performing speech recognition upon the modified feature vector sequence. 

2. A method according to claim 1, in which the feature vector comprises a plurality 
of components and the generating step comprises 

estimating a component of a replacement feature vector by interpolating the 
corresponding component of a received feature vector. 

15 3. A method according to claim 2, in which the estimating step uses an interpolation 

coefficient corresponding to a component of the received feature vector and further 

comprising the step of 

updating the interpolation coefficient in response according to one or more 

received feature vectors. 
20 4. A method according to claim 1, in which the transmitted feature vectors include 

features in a cepstral domain, and in which the estimating step comprises the sub steps of 

converting a received feature vector to a spectral domain; 

estimating a spectral component by interpolating the corresponding component of 
the converted feature vector; and 
25 converting the estimated spectral component to said cepstral domain. 

5. A device for performing speech recognition upon a sequence of parameterised 
feature vectors comprising 

a missing feature vector detector arranged in operation to receive the transmitted 
feature vectors and to indicate the absence of a feature vector in the received sequence; 
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a feature vector estimator arranged, in operation, to receive transmitted feature 
vectors and responsive to said indication from the missing feature vector detector to 
estimate a replacement feature vector; 

a sequence reconstructer arranged, in operation, to receive transmitted feature 
5 vectors and to receive a replacement feature vector and to provide as an output a 
modified feature vector sequence; and 

a speech recogniser arranged, in operation, to receive the modified feature vector 
sequence. 

6. A device according to claim 5, in which the feature vector estimator comprises 

10 an interpolator arranged to receive a feature vector and to provide as an 

output a component of the replacement feature vector. 

7. A device according to claim 6, in which the interpolator uses an interpolation 
coefficient corresponding to a component of the received feature vector and in which the 
interpolator is arranged to update the interpolation coefficient in response to receipt of a 

1 5 feature vector. 

8. A device according to claim 6, in which the feature vector estimator comprises 

a first converter for converting a received feature vector to a spectral domain; 

an estimator for estimating a spectral component by interpolating the 
corresponding component of the converted frame; 

20 a second converter for converting the estimated spectral component to said 

cepstral domain. 

9. A data carrier loadable into a computer and carrying instructions for causing the 
computer to carry out the method according to any one of claims 1 to 4. 

10. A data carrier loadable into a computer and carrying instructions for enabling the 
25 computer to provide the device according to any one of claims 5 to 8. 

11. A method of speech recognition substantially as described herein with reference 
to Figures 5 to 8. 

12. A device for recognising speech substantially as described herein with reference 
to Figures 5 to 8. 

30 
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ABSTRACT 
Speech Recognition 



A speech recogniser suitable for distributed speech recognition is robust to missing 
5 speech feature vectors. Speech is transmitted via a packet switched network in the form 
of basic feature vectors. Missing feature vectors are detected and replacement feature 
vectors are estimated by interpolation of received data prior to speech recognition. In an 
improved version features are converted and interpolation is done in a spectral domain. 

1 0 Figure 5 
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